Methods and systems for processing a log file

ABSTRACT

Disclosed are methods, apparatus, systems, and computer program products for provisioning log file data. A server may parse through log files by selecting particular log entries and data fields. Customer-facing log files may be generated based on the selected log entries and data fields.

PRIORITY DATA

This patent document claims priority to co-pending and commonly assigned U.S. Provisional Patent Application No. 61/898,858, titled “System and Method for Providing Log Files Data to End Users,” by Torman, et al., filed on Nov. 1, 2013, which is hereby incorporated by reference in its entirety and for all purposes.

COPYRIGHT NOTICE

A portion of the disclosure of this patent document contains material, which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.

TECHNICAL FIELD

This patent document relates generally to providing log file data to users of a multi-tenant database system, and more specifically, to parsing log files of the multi-tenant database system to generate customer-facing log files.

BACKGROUND

“Cloud computing” services provide shared resources, software, and information to computers and other devices upon request. In cloud computing environments, software can be accessible over the Internet rather than installed locally on in-house computer systems. Cloud computing typically involves over-the-Internet provision of dynamically scalable and often virtualized resources. Technological details can be abstracted from the users, who no longer have need for expertise in, or control over, the technology infrastructure “in the cloud” that supports them.

BRIEF DESCRIPTION OF THE DRAWINGS

The included drawings are for illustrative purposes and serve only to provide examples of possible structures and operations for the disclosed inventive systems, apparatus, methods and computer program products for processing a log file. These drawings in no way limit any changes in form and detail that may be made by one skilled in the art without departing from the spirit and scope of the disclosed implementations.

FIG. 1 shows a system diagram of an example of architectural components 100 for provisioning log file data according to some implementations.

FIG. 2 shows a flowchart of an example of provisioning log file data.

FIG. 3A shows an example of parsing log files of a multi-tenant database system to generate customer-facing log files.

FIG. 3B shows an example of multiple log entries associated with events according to some implementations.

FIG. 4 shows an example of an operating environment according to some implementations.

FIG. 5A shows a block diagram of an example of an environment 10 in which an on-demand database service can be used in accordance with some implementations.

FIG. 5B shows a block diagram of an example of some implementations of elements of FIG. 5A and various possible interconnections between these elements.

FIG. 6A shows a system diagram illustrating an example of architectural components of an on-demand database service environment 1200 according to some implementations.

FIG. 6B shows a system diagram further illustrating an example of architectural components of an on-demand database service environment according to some implementations.

DETAILED DESCRIPTION

Examples of systems, apparatus, and methods according to the disclosed implementations are described in this section. These examples are being provided solely to add context and aid in the understanding of the disclosed implementations. It will thus be apparent to one skilled in the art that implementations may be practiced without some or all of these specific details. In other instances, certain process/method operations, also referred to herein as “blocks,” have not been described in detail in order to avoid unnecessarily obscuring implementations. Other applications are possible, such that the following examples should not be taken as definitive or limiting either in scope or setting.

In the following detailed description, references are made to the accompanying drawings, which form a part of the description and in which are shown, by way of illustration, specific implementations. Although these implementations are described in sufficient detail to enable one skilled in the art to practice the disclosed implementations, it is understood that these examples are not limiting, such that other implementations may be used and changes may be made without departing from their spirit and scope. For example, the blocks of methods shown and described herein are not necessarily performed in the order indicated. It should also be understood that the methods may include more or fewer blocks than are indicated. In some implementations, blocks described herein as separate blocks may be combined. Conversely, what may be described herein as a single block may be implemented in multiple blocks.

Various implementations described or referenced herein are directed to different systems, apparatus, methods, and computer-readable storage media for provisioning log file data. For example, a database service can serve multiple tenants running applications on the database service (i.e., a multi-tenant database service). Generally, the actions of users (e.g., the clients of the tenants) when using the applications may generate data that is often saved in log files that may be used to understand the activities of the system, debug issues, and other analytical tasks such as roll-up adoption metrics. The multi-tenant database service is usually designed to store the data in log files so that a system administrator of the database service can later analyze it.

In some instances, the tenants may want to look at some of the data in the log files to perform their own analysis on the data. For example, a tenant may want to access log file data indicating user login activities to determine where users (i.e., clients of the tenant) are logging into applications from and plot the locations on a map. However, the log file created by the multi-tenant database service includes the data of multiple tenants. The system administrator may not want to have one tenant have access to data associated with another tenant's clients. Moreover, the system administrator may set up the log file to generate additional data, such as system performance details or other internal metrics, related to the tenants that should not be shared with any tenants. Accordingly, a server may parse through the log file generated by the multi-tenant database service and create customer-facing log files for each of the tenants. Some types of log entries as well as some types of data fields of the log entries can be excluded from the customer-facing log files.

For example, the system administrator may set up the multi-tenant database system to generate log entries in one or more log files for activities such as logins to applications running on the system, application programming interface (API) events (e.g., when and how the API is used), file or report downloads, and user interface clicks (e.g., clicking on a button of a user interface provided by an application running on the system). A tenant may be interested in the log entries related to logins, downloads, and user interface clicks in order to determine how clients are accessing their applications, what the clients are downloading, and what features are being used by the clients. The system administrator may be interested in those log entries as well as API events. However, the system administrator may not want to provide the tenant the log entries associated with the API events because the data may be too detailed, provides specific data regarding the performance of the system, or other details that the tenant does not need. Additionally, some log entry fields of the log entries may also provide data that the system administrator does not want to provide to the tenants.

As such, the system administrator can specify that only particular types of log entries may be provided to the tenants. Furthermore, the system administrator can specify that only particular types of fields of the log entries may be provided to the tenants. The system can parse through the log files and generate customer-facing log files with log entries specific to that tenant's applications and with the fields of the log entries approved by the system administrator. Accordingly, a large amount of log file data may be reduced such that each tenant receives a smaller and tailored amount of log file data so that the tenants receive more meaningful log file data.

These and other implementations may be embodied in various types of hardware, software, firmware, and combinations thereof. For example, some techniques disclosed herein may be implemented, at least in part, by computer-readable media that include program instructions, state information, etc., for performing various services and operations described herein. Examples of program instructions include both machine code, such as produced by a compiler, and files containing higher-level code that may be executed by a computing device such as a server or other data processing apparatus using an interpreter. Examples of computer-readable media include, but are not limited to, magnetic media such as hard disks, floppy disks, and magnetic tape; optical media such as CD-ROM disks; magneto-optical media; and hardware devices that are specially configured to store program instructions, such as read-only memory (“ROM”) devices and random access memory (“RAM”) devices. These and other features of the disclosed implementations will be described in more detail below with reference to the associated drawings.

In some but not all implementations, the disclosed methods, apparatus, systems, and computer-readable storage media may be configured or designed for use in a multi-tenant database environment.

The term “multi-tenant database system” can refer to those systems in which various elements of hardware and software of a database system may be shared by one or more customers. For example, a given application server may simultaneously process requests for a great number of customers, and a given database table may store rows of data such as feed items for a potentially much greater number of customers. The term “query plan” generally refers to one or more operations used to access information in a database system.

A “user profile” or “user's profile” is generally configured to store and maintain data about a given user of the database system. The data can include general information, such as name, title, phone number, a photo, a biographical summary, and a status, e.g., text describing what the user is currently doing. As mentioned below, the data can include messages created by other users. Where there are multiple tenants, a user is typically associated with a particular tenant. For example, a user could be a salesperson of a company, which is a tenant of the database system that provides a database service.

The term “record” generally refers to a data entity, such as an instance of a data object created by a user of the database service, for example, about a particular (actual or potential) business relationship or project. The data object can have a data structure defined by the database service (a standard object) or defined by a user (custom object). For example, a record can be for a business partner or potential business partner (e.g., a client, vendor, distributor, etc.) of the user, and can include information describing an entire company, subsidiaries, or contacts at the company. As another example, a record can be a project that the user is working on, such as an opportunity (e.g., a possible sale) with an existing partner, or a project that the user is trying to get. In one implementation of a multi-tenant database system, each record for the tenants has a unique identifier stored in a common table. A record has data fields that are defined by the structure of the object (e.g., fields of certain data types and purposes). A record can also have custom fields defined by a user. A field can be another record or include links thereto, thereby providing a parent-child relationship between the records.

FIG. 1 shows a system diagram of an example of architectural components 100 for provisioning log file data according to some implementations. Architectural components 100 may provide communications to be transmitted among a variety of different hardware and/or software components. In FIG. 1, architectural components 100 include log server 105, user system 110 a, user system 110 b, customer-facing log files 115, parsing data 117, application server 120, and log files 125. In other implementations, the functionality in the architectural components may be implemented in more or less servers.

User systems 110 a and 110 b may be any type of computing device. For example, user systems 110 a and 110 b may be portable electronic devices such as smartphones, tablets, laptops, wearable devices (e.g., smart watches), etc. User systems 110 a and 110 b may be another server or a desktop computer. Additionally, user systems 110 a and 110 b may be different types of computing devices. For example, user system 110 a may be a desktop computer whereas user system 110 b may be a smartphone. In some implementations, user systems 110 a and/or 110 b may be an integration service.

In some implementations, application server 120 may include applications used by different tenants of application server 120. As each client of each tenant interacts with the applications, log entries corresponding to the interactions may be generated and saved in log files in log files database 125, which may be a content management system, document repository, database or other storage mechanism for log files. In some implementations, log files 125 may be integrated within application server 120.

For example, if a tenant's client logs into an application, a corresponding log entry may be stored in a log file in log files 125. The log entry may include a variety of data such as a tenant ID (i.e., a unique identifier associated with the tenant), event type (i.e., a login), location (i.e., the geographic location from which the client logged into the application), timestamp (i.e., when the login occurred), and internal system information (e.g., a server load associated with the login). If another client of another tenant logs into the application, another log entry may be stored in the same log file.

As another example, if a client downloads a file, another log entry may be generated in the same log file or in another log file in log files database 125. The new log entry may include data such as the tenant ID, event type (i.e., a download), timestamp (i.e., when the event, or download, occurred), the file name of the downloaded file, and internal system information (e.g., the bandwidth used by the system to provide the download).

At a particular time, for example at 2 A.M. or during other periods of low activity, application server 125 may obtain the log files from log files 125 and provide the log files to log server 105.

Log server 105 may receive the log files from application server 120 and use data from parsing data 117 to parse through the log files and generate customer-facing log files for each of the tenants to be stored in customer-facing log files database 115. In some implementations, log server 105 may directly store the customer-facing log files in customer-facing log files database 115. In other implementations, application server 120 may receive the customer-facing log files from log server 105 and then store them in customer-facing log files database 115. That is, customer-facing log files database 115 may include log files specific for each tenant based on the log files provided by application server 120 from log files database 125 and parsing data 117. Accordingly, co-mingled data associated with multiple tenants may be split into separate log files.

For example, the log files in log files database 125 may each include log entries associated with different events. Additionally, each log entry may include a variety of fields associated with the event. As an example, as previously discussed, a download event type log entry may include fields providing data such as the tenant ID, event type, timestamp, file name, and bandwidth information. Parsing data 117 may indicate which types of log entries (e.g., log entries associated with download events) may be used to generate the customer-facing log files stored in customer-facing log files database 115. Additionally, parsing data 117 may indicate which types of fields of the log entries (e.g., event type, timestamp, and file name) may be used to generate the customer-facing log files. That is, parsing data 117 may indicate data fields that can be represented in the customer-facing log files and/or the fields that may be purposefully kept away from the tenants. Accordingly, log server 105 receives log files from application server 120 and parses through and pares down the data in log files to generate smaller, customer-facing log files. Log server 105 or application server 120 may store the customer-facing log files in customer-facing log files 115, which may be a database or other type of storage.

As such, user system 110 a and user system 110 b may be able to access their respective log files by application server 120. For example, user system 110 a may be able to access its own customer-facing log file providing details based on the actions of its clients from log server 105. Likewise, user system 110 b may be able to access its own customer-facing log file providing details of its clients from log server 105. As such, a subset of the data from log files of a multi-tenant database system may be provided to the corresponding individual tenants.

FIG. 2 shows a flowchart of an example of provisioning log file data. Method 200 (and other methods described herein) may be implemented by the architectural components of FIG. 1. In various implementations, blocks may be reordered, omitted, combined, or split into additional blocks for method 200, as well as other methods described herein.

In block 202, log entries may be generated by a multi-tenant system. In the multi-tenant system, the clients (or users) of the tenants using the applications hosted by the multi-tenant system may be performing actions that may result in a log entry being generated in a log file hosted by the system. Each log entry may be a line in the log file (i.e., an application log line).

For example, in FIG. 3A, log file 305 may include 4 log entries: log entry A, log entry B, log entry C, and log entry D. Each of the four log entries in log file 305 may be generated when a client of a tenant logs into an application hosted by the multi-tenant system. As an example, log entries A and C may be created when the clients of a tenant log into an application. In particular, the system may capture or determine that the event occurred and then generate the log entry. Log entries B and D may be created in the same log file 305 when different clients of a different tenant log into an application. Accordingly, a single log file may include log entries from the clients of multiple tenants. That is, the log file may include data associated with different tenants of the multi-tenant system.

Each log entry generated upon a client logging in includes five data fields: tenant ID, event type, location, timestamp, and server load, as depicted for log entry D in FIG. 3A. Tenant ID may be an identifier (e.g., a unique number) associated with the tenant. Different clients of different tenants may generate different corresponding tenant IDs in the respective log entries. For example, log entry A and log entry C may both have the same tenant ID because the client actions are associated with the same tenant. Likewise, log entry B and log entry D may both have another tenant ID because the client actions are associated with another tenant. The event data field may indicate the event type or user action that resulted in the generation of the log entry, for example “login” in the example of FIG. 3A. The location data field of the log entries in log file 305 may indicate a geographic location (e.g., country, state, city, etc.) from which the clients are logging in from. The timestamp data field may indicate when the client action (i.e., logging in) was performed. The server load data field may indicate the server load at the time when the client action was performed.

Log file 310 includes five log entries: log entry E, log entry F, log entry G, log entry H, and log entry X. Each of the five log entries in log file 310 may also be generated upon a client action by the multi-tenant system, similar to log file 305. However, rather than each log entry in log file 310 being generated upon a client logging into an application, log entries E-H of the log entries in log file 310 may be generated upon a client downloading a file and log entry X may be generated upon a client using an API. Accordingly, log file 310 includes co-mingled data from multiple tenants as well as co-mingled log entries of different types (e.g., download and API event types). Each of the log entries E-H in log file 310 includes five data fields: tenant ID, event type, timestamp, file, and bandwidth. Tenant ID may also indicate the particular tenant associated with the client performing the action that result in the generated log entries. Event type may also indicate type of action that led to the generation of the log entry, for example, “download.” Similar to log file 305, timestamp may also be the time when the action (i.e., downloading a file) was performed. The file data field may indicate the name of the file that was downloaded by the client. Lastly, the bandwidth data field may indicate the bandwidth used by the system to allow for the client to download the file. By contrast, log entry X may include different data fields than log entries E-H because log entry X is for a different event type (i.e., an API use).

In block 205, log files may be retrieved and shipped by the server. For example, upon a time when the system is relatively idle, log files (e.g., log 305 and log 310) may be transferred from the application server (e.g., application server 120) to a log server (e.g., log server 105) to process the log files. As an example, FIG. 4 shows an operating environment according to some implementations. In FIG. 4, log shipper 405 may retrieve log files 125 from application server 120 and “ship” the logs to log server 405. In some implementations, log shipper 405 may be a separate server, part of application server 120, log server 105, etc. such that it allows for log files to be provided to log server 105.

Accordingly, in block 210, the log files may be received by log server 105. In some implementations, the log files may be stored by the log server and then analyzed and/or parsed as disclosed herein. However, in other implementations, the log files may be analyzed and parsed on-the-fly as the log files are received by log server 125 from log shipper 405.

In block 215, a subset of the log entries of the log files may be selected. In some implementations, log server 105 may include map reduce logic 410 for selecting the subset of the log entries. In particular, map reduce logic 410 may analyze the received log files 125 and determine which log entries should be provided to particular clients.

As an example, over 200 types of log entries generated upon different events may be in a log file. However, the system administrator may only want the tenants to receive log lines that correspond to 28 approved entry types. For example, log entries corresponding to logins and downloads (e.g., log entries A-D in log 305 and E-H in log 310) may be approved, but log entries corresponding to API events (e.g., log entry X in log 310) may not be approved because the associated log entries are mainly for internal purposes that should not be provided to the tenants. Accordingly, the system administrator may designate approved log entries, for example, in an extensible markup language (XML) data file with designations or attributions of the approved types of entries. The XML data file may be provided to map reduce logic 410 as a system selection indicating the approved log entries.

In some implementations, block 215 may be initiated by CRON 415 at a time after log shipper 405 provides the log files to log server 105. For example, CRON 415 may be a time-based job scheduler set up to inform map reduce 410 when to begin. In some implementations, if log shipper 405 transfers logs from application server 120 to log server 105 at 2 A.M., then CRON 415 may inform map reduce 410 to begin at 3 A.M. (i.e., a time later than the time when log shipper 405 provides the log files to log server 105). In some implementations, the time CRON 415 may inform map reduce 410 to begin may be based on how many logs log shipper 405 need to provide to log server 105. For example, a higher number of logs needing to be provided may have a later map reduce 410 start time than a lower number of logs. At block 220, a subset of data fields of the subset of entries may be selected.

In particular, map reduce logic 410 may also reduce the number of data fields of the log entries that were selected in block 215. Similar to the system administrator designating approved log entries, particular data fields may also be approved, for example, in the same or another XML data file with designations or attributions of the approved data fields.

At block 225, customer-facing log files may be generated. In particular, customer-facing log files corresponding to the tenants may be generated based on the selected log entries and data fields from blocks 215 and 220.

For example, in FIG. 3A, log files 315, 320, 325, and 330 may be customer-facing log files with data corresponding to the selected log entries and data fields. That is, log files 315, 320, 325, and 330 combined may include a subset of the total data of log files 305 and 310. Log file 315 includes log entries A and C. Log file 320 includes log entries B and D. Log file 325 includes log entries E and F. Log file 320 includes entries G and H. The entries may be included in the log files 315, 320, 325, and 330 because the entries were selected in block 215. Four customer-facing log files (i.e., log files 315, 320, 325, and 330) may be created from the two log files 305 and 310 because each tenant may receive a log file without any entries associated with another tenant (e.g., a log entry generated based on an action by another tenant's client). For example, in FIG. 3A, log files 315 and 325 may include actions of one tenant's clients whereas log files 320 and 330 may include of another tenant's clients.

In FIG. 3A, each tenant may have two customer-facing log files generated because two log files 305 and 310 were parsed at blocks 215 and 220. However, in other implementations, the data in log files 315 and 325 may be provided in a single customer-facing log file. Likewise, the data in log files 315 and 330 may also be provided in a single customer-facing log file.

In FIG. 3A, log entry X from log file 310 is not provided in any of the customer facing log files 315, 320, 325, and 330 because, as previously discussed, log entry X may be associated with an event type that is not approved to be provided to the tenants, and therefore, would not be selected in block 215.

Additionally, in FIG. 3A, the log entries in log files 315, 320, 325, and 330 may also include less data fields than the corresponding entries in log files 305 and 310 based on block 220. In FIG. 3A, log entry D in log file 320 includes less data fields than log entry D in log entry 305. In particular, tenant ID and server load data fields are not provided in the customer-facing log file because they were not specified as being approved, as previously discussed. Likewise, log entry E in log file 325 also includes less data fields than log entry E in data file 310 for similar reasons.

In some implementations, activities of user systems 110 a or 110 b may generate multiple log entries associated with events. FIG. 3B shows an example of multiple log entries associated with events according to some implementations. In FIG. 3B, the shaded data is excluded from the customer-facing log files. The non-shaded data is included in the customer-facing log files.

In FIG. 3B, three different actions (i.e., logins, Indexing, and URI) may be performed by user system 110 a, captured, and corresponding log entries may be generated. As previously discussed, some entries associated with certain event types (e.g., Indexing in FIG. 3B) may not be provided in the customer-facing log files, and therefore, is shaded in FIG. 3B. Log entries associated with Logins and URI may be in the customer-facing log files, and therefore, are not shaded. However, some types of events may generate multiple log entries. For example, in FIG. 3B, a URI event generates 3 log entries all of the same entry type (e.g., URI): Cascading Style Sheets (CSS) (e.g., for accessing a CSS file), JavaServer Pages (JSP) (e.g., for accessing a JSP file), and JavaScript (JS) (e.g., for accessing a JS file). These “sub-entries” may further be indicated by the system administrator as being approved (e.g., in the same XML file as previously discussed) to be in the customer-facing log files. In FIG. 3B, CSS and JS types of sub-entries of the URI entry type are excluded from the customer-facing log files. However, the JSP sub-entry type may be provided in the customer-facing log files. Certain data fields in FIG. 3B (e.g., server load) are excluded from the customer-facing log files because the data field is excluded, as previously discussed. Accordingly, a subset of entries (i.e., the sub-entries) of the event (e.g., URI) may be selected. In block 230, the tenants may retrieve their customer-facing log files. In some implementations, after log server 105 generates the customer-facing log files (i.e., log files 315, 320, 325, and 330), the files may be stored in customer-facing log files 115, which may be a database or other storage mechanism for the log files.

Additionally, map reduce 410 may provide data to writer 420 of application server 120 with details on the customer-facing log files. Writer 420 may provide the details to BPO 425, which may create a data object by writing a row in database 430 with a pointer (e.g., a URL) to the appropriate log files stored in customer-facing log files 115. For example, BPO 425 may write in database 430 a URL or data path for each of the customer-facing log files so that the tenant may later be able to access the customer-facing log files. Additionally, the tenant ID for each of the customer-facing log files may also be stored in the rows.

In particular, the tenant may access the customer-facing log files by using API 435. For example, the tenant may use the API to contact BPO 425 to find the pointers for its customer-facing log files based on determining the pointers in a row with a corresponding tenant ID. The tenant may then be provided the customer-facing log files from customer-facing log files 115 based on using the pointers. As a result, BPO 425 (base platform object, or an abstract database object) allows a user to access, via the API, a database table with the pointers to the stored customer-facing log files. In some implementations, in addition to the pointers, users may also be provided attributes of the customer-facing log files such length (i.e., the file size), log date, and type of log.

In some implementations, the customer-facing log files generated by map reduce 410 may be comma separated value (CSV) files with each log entry on its own line (e.g., of a text file) with each data field separate by a comma. As a result, tenants may receive the CSV files with the log entries and data fields and use the data to develop their own applications. For example, tenants may be able to plot on a map the geographical locations where clients are downloading files from and determine whether data leakage problems exist, for example, by finding out that a file was downloaded from an unsecure location. Tenants may also use the customer-facing log files for compliance and auditing purposes. Additionally, comingled data may be split into tenant-specific data in tenant-specific customer-facing log files. As such, the customer-facing log files may be integrated into third-party applications developed by applications developed by the tenants.

In some implementations, only specific tenants may be provided with customer-facing log files. For example, tenants may pay to receive customer-facing log files, and therefore, the tenant ID data field in log entries may be analyzed to determine whether the tenant ID belongs to a tenant that pays for the service. Tenants who pay for the service may have their customer-facing log files stored in customer-facing log files 115 and access the logs through API 435. Clients who do not pay may not have any log files in customer-facing log files 115, or may not be able to access any sort of log file in customer-facing log files 115.

In some implementations, the tenants may provide indications as to selections of fields and types of log entries that they may want to be included in their corresponding customer-facing log files. The selections may then also be used to determine the log entries and data fields to be included in the customer-facing log files. For example, the tenants may only want their customer-facing log files to include entries related to security events (e.g., logins), and therefore, map reduce 410 may further consider tenant indications when parsing the log files.

FIG. 5A shows a block diagram of an example of an environment 10 in which an on-demand database service can be used in accordance with some implementations. Environment 10 may include user systems 12, network 14, database system 16, processor system 17, application platform 18, network interface 20, tenant data storage 22, system data storage 24, program code 26, and process space 28. In other implementations, environment 10 may not have all of these components and/or may have other components instead of, or in addition to, those listed above.

Environment 10 is an environment in which an on-demand database service exists. User system 12 may be implemented as any computing device(s) or other data processing apparatus such as a machine or system that is used by a user to access a database system 16. For example, any of user systems 12 can be a handheld computing device, a mobile phone, a laptop computer, a work station, and/or a network of such computing devices. As illustrated in FIG. 5A (and in more detail in FIG. 5B) user systems 12 might interact via a network 14 with an on-demand database service, which is implemented in the example of FIG. 5A as database system 16.

An on-demand database service, implemented using system 16 by way of example, is a service that is made available to outside users, who do not need to necessarily be concerned with building and/or maintaining the database system. Instead, the database system may be available for their use when the users need the database system, i.e., on the demand of the users. Some on-demand database services may store information from one or more tenants into tables of a common database image to form a multi-tenant database system (MTS). A database image may include one or more database objects. A relational database management system (RDBMS) or the equivalent may execute storage and retrieval of information against the database object(s). Application platform 18 may be a framework that allows the applications of system 16 to run, such as the hardware and/or software, e.g., the operating system. In some implementations, application platform 18 enables creation, managing and executing one or more applications developed by the provider of the on-demand database service, users accessing the on-demand database service via user systems 12, or third party application developers accessing the on-demand database service via user systems 12.

The users of user systems 12 may differ in their respective capacities, and the capacity of a particular user system 12 might be entirely determined by permissions (permission levels) for the current user. For example, where a salesperson is using a particular user system 12 to interact with system 16, that user system has the capacities allotted to that salesperson. However, while an administrator is using that user system to interact with system 16, that user system has the capacities allotted to that administrator. In systems with a hierarchical role model, users at one permission level may have access to applications, data, and database information accessible by a lower permission level user, but may not have access to certain applications, database information, and data accessible by a user at a higher permission level. Thus, different users will have different capabilities with regard to accessing and modifying application and database information, depending on a user's security or permission level, also called authorization.

Network 14 is any network or combination of networks of devices that communicate with one another. For example, network 14 can be any one or any combination of a LAN (local area network), WAN (wide area network), telephone network, wireless network, point-to-point network, star network, token ring network, hub network, or other appropriate configuration. Network 14 can include a TCP/IP (Transfer Control Protocol and Internet Protocol) network, such as the global internetwork of networks often referred to as the “Internet” with a capital “I.” The Internet will be used in many of the examples herein. However, it should be understood that the networks that the present implementations might use are not so limited, although TCP/IP is a frequently implemented protocol.

User systems 12 might communicate with system 16 using TCP/IP and, at a higher network level, use other common Internet protocols to communicate, such as HTTP, FTP, AFS, WAP, etc. In an example where HTTP is used, user system 12 might include an HTTP client commonly referred to as a “browser” for sending and receiving HTTP signals to and from an HTTP server at system 16. Such an HTTP server might be implemented as the sole network interface 20 between system 16 and network 14, but other techniques might be used as well or instead. In some implementations, the network interface 20 between system 16 and network 14 includes load sharing functionality, such as round-robin HTTP request distributors to balance loads and distribute incoming HTTP requests evenly over a plurality of servers. At least for users accessing system 16, each of the plurality of servers has access to the MTS' data; however, other alternative configurations may be used instead.

In one implementation, system 16, shown in FIG. 5A, implements a web-based customer relationship management (CRM) system. For example, in one implementation, system 16 includes application servers configured to implement and execute CRM software applications as well as provide related data, code, forms, web pages and other information to and from user systems 12 and to store to, and retrieve from, a database system related data, objects, and Webpage content. With a multi-tenant system, data for multiple tenants may be stored in the same physical database object in tenant data storage 22, however, tenant data typically is arranged in the storage medium(s) of tenant data storage 22 so that data of one tenant is kept logically separate from that of other tenants so that one tenant does not have access to another tenant's data, unless such data is expressly shared. In certain implementations, system 16 implements applications other than, or in addition to, a CRM application. For example, system 16 may provide tenant access to multiple hosted (standard and custom) applications, including a CRM application. User (or third party developer) applications, which may or may not include CRM, may be supported by the application platform 18, which manages creation, storage of the applications into one or more database objects and executing of the applications in a virtual machine in the process space of the system 16.

One arrangement for elements of system 16 is shown in FIGS. 5A and 5B, including a network interface 20, application platform 18, tenant data storage 22 for tenant data 23, system data storage 24 for system data 25 accessible to system 16 and possibly multiple tenants, program code 26 for implementing various functions of system 16, and a process space 28 for executing MTS system processes and tenant-specific processes, such as running applications as part of an application hosting service. Additional processes that may execute on system 16 include database indexing processes.

Several elements in the system shown in FIG. 5A include conventional, well-known elements that are explained only briefly here. For example, each user system 12 could include a desktop personal computer, workstation, laptop, PDA, tablet, smartphone, or any wireless access protocol (WAP) enabled device or any other computing device capable of interfacing directly or indirectly to the Internet or other network connection. The term “computing device” is also referred to herein simply as a “computer”. User system 12 typically runs an HTTP client, e.g., a browsing program, such as Microsoft's Internet Explorer browser, Netscape's Navigator browser, Opera's browser, or a WAP-enabled browser in the case of a cell phone, PDA or other wireless device, or the like, allowing a user (e.g., subscriber of the multi-tenant database system) of user system 12 to access, process and view information, pages and applications available to it from system 16 over network 14. Each user system 12 also typically includes one or more user input devices, such as a keyboard, a mouse, trackball, touch pad, touch screen, pen or the like, for interacting with a graphical user interface (GUI) provided by the browser on a display (e.g., a monitor screen, LCD display, etc.) of the computing device in conjunction with pages, forms, applications and other information provided by system 16 or other systems or servers. For example, the user interface device can be used to access data and applications hosted by system 16, and to perform searches on stored data, and otherwise allow a user to interact with various GUI pages that may be presented to a user. As discussed above, implementations are suitable for use with the Internet, although other networks can be used instead of or in addition to the Internet, such as an intranet, an extranet, a virtual private network (VPN), a non-TCP/IP based network, any LAN or WAN or the like.

According to one implementation, each user system 12 and all of its components are operator configurable using applications, such as a browser, including computer code run using a central processing unit such as an Intel Pentium® processor or the like. Similarly, system 16 (and additional instances of an MTS, where more than one is present) and all of its components might be operator configurable using application(s) including computer code to run using processor system 17, which may be implemented to include a central processing unit, which may include an Intel Pentium® processor or the like, and/or multiple processor units. Non-transitory computer-readable media can have instructions stored thereon/in, that can be executed by or used to program a computing device to perform any of the methods of the implementations described herein. Computer program code 26 implementing instructions for operating and configuring system 16 to intercommunicate and to process web pages, applications and other data and media content as described herein is preferably downloadable and stored on a hard disk, but the entire program code, or portions thereof, may also be stored in any other volatile or non-volatile memory medium or device as is well known, such as a ROM or RAM, or provided on any media capable of storing program code, such as any type of rotating media including floppy disks, optical discs, digital versatile disk (DVD), compact disk (CD), microdrive, and magneto-optical disks, and magnetic or optical cards, nanosystems (including molecular memory ICs), or any other type of computer-readable medium or device suitable for storing instructions and/or data. Additionally, the entire program code, or portions thereof, may be transmitted and downloaded from a software source over a transmission medium, e.g., over the Internet, or from another server, as is well known, or transmitted over any other conventional network connection as is well known (e.g., extranet, VPN, LAN, etc.) using any communication medium and protocols (e.g., TCP/IP, HTTP, HTTPS, Ethernet, etc.) as are well known. It will also be appreciated that computer code for the disclosed implementations can be realized in any programming language that can be executed on a client system and/or server or server system such as, for example, C, C++, HTML, any other markup language, Java™, JavaScript, ActiveX, any other scripting language, such as VBScript, and many other programming languages as are well known may be used. (Java™ is a trademark of Sun Microsystems, Inc.).

According to some implementations, each system 16 is configured to provide web pages, forms, applications, data and media content to user (client) systems 12 to support the access by user systems 12 as tenants of system 16. As such, system 16 provides security mechanisms to keep each tenant's data separate unless the data is shared. If more than one MTS is used, they may be located in close proximity to one another (e.g., in a server farm located in a single building or campus), or they may be distributed at locations remote from one another (e.g., one or more servers located in city A and one or more servers located in city B). As used herein, each MTS could include one or more logically and/or physically connected servers distributed locally or across one or more geographic locations. Additionally, the term “server” is meant to refer to a computing device or system, including processing hardware and process space(s), an associated storage medium such as a memory device or database, and, in some instances, a database application (e.g., OODBMS or RDBMS) as is well known in the art. It should also be understood that “server system” and “server” are often used interchangeably herein. Similarly, the database objects described herein can be implemented as single databases, a distributed database, a collection of distributed databases, a database with redundant online or offline backups or other redundancies, etc., and might include a distributed database or storage network and associated processing intelligence.

FIG. 5B shows a block diagram of an example of some implementations of elements of FIG. 5A and various possible interconnections between these elements. That is, FIG. 5B also illustrates environment 10. However, in FIG. 5B elements of system 16 and various interconnections in some implementations are further illustrated. FIG. 5B shows that user system 12 may include processor system 12A, memory system 12B, input system 12C, and output system 12D. FIG. 5B shows network 14 and system 16. FIG. 5B also shows that system 16 may include tenant data storage 22, tenant data 23, system data storage 24, system data 25, User Interface (UI) 30, Application Program Interface (API) 32, PL/SOQL 34, save routines 36, application setup mechanism 38, applications servers 50 ₁-50 _(N), system process space 52, tenant process spaces 54, tenant management process space 60, tenant storage space 62, user storage 64, and application metadata 66. In other implementations, environment 10 may not have the same elements as those listed above and/or may have other elements instead of, or in addition to, those listed above.

User system 12, network 14, system 16, tenant data storage 22, and system data storage 24 were discussed above in FIG. 5A. Regarding user system 12, processor system 12A may be any combination of one or more processors. Memory system 12B may be any combination of one or more memory devices, short term, and/or long term memory. Input system 12C may be any combination of input devices, such as one or more keyboards, mice, trackballs, scanners, cameras, and/or interfaces to networks. Output system 12D may be any combination of output devices, such as one or more monitors, printers, and/or interfaces to networks. As shown by FIG. 5B, system 16 may include a network interface 20 (of FIG. 5A) implemented as a set of HTTP application servers 50, an application platform 18, tenant data storage 22, and system data storage 24. Also shown is system process space 52, including individual tenant process spaces 54 and a tenant management process space 60. Each application server 50 may be configured to communicate with tenant data storage 22 and the tenant data 23 therein, and system data storage 24 and the system data 25 therein to serve requests of user systems 12. The tenant data 23 might be divided into individual tenant storage spaces 62, which can be either a physical arrangement and/or a logical arrangement of data. Within each tenant storage space 62, user storage 64 and application metadata 66 might be similarly allocated for each user. For example, a copy of a user's most recently used (MRU) items might be stored to user storage 64. Similarly, a copy of MRU items for an entire organization that is a tenant might be stored to tenant storage space 62. A UI 30 provides a user interface and an API 32 provides an application programmer interface to system 16 resident processes to users and/or developers at user systems 12. The tenant data and the system data may be stored in various databases, such as one or more Oracle databases.

Application platform 18 includes an application setup mechanism 38 that supports application developers' creation and management of applications, which may be saved as metadata into tenant data storage 22 by save routines 36 for execution by subscribers as one or more tenant process spaces 54 managed by tenant management process 60 for example. Invocations to such applications may be coded using PL/SOQL 34 that provides a programming language style interface extension to API 32. A detailed description of some PL/SOQL language implementations is discussed in commonly assigned U.S. Pat. No. 7,730,478, titled METHOD AND SYSTEM FOR ALLOWING ACCESS TO DEVELOPED APPLICATIONS VIA A MULTI-TENANT ON-DEMAND DATABASE SERVICE, by Craig Weissman, issued on Jun. 1, 2010, and hereby incorporated by reference in its entirety and for all purposes. Invocations to applications may be detected by one or more system processes, which manage retrieving application metadata 66 for the subscriber making the invocation and executing the metadata as an application in a virtual machine.

Each application server 50 may be communicably coupled to database systems, e.g., having access to system data 25 and tenant data 23, via a different network connection. For example, one application server 50 ₁ might be coupled via the network 14 (e.g., the Internet), another application server 50 _(N-1) might be coupled via a direct network link, and another application server 50 _(N) might be coupled by yet a different network connection. Transfer Control Protocol and Internet Protocol (TCP/IP) are typical protocols for communicating between application servers 50 and the database system. However, it will be apparent to one skilled in the art that other transport protocols may be used to optimize the system depending on the network interconnect used.

In certain implementations, each application server 50 is configured to handle requests for any user associated with any organization that is a tenant. Because it is desirable to be able to add and remove application servers from the server pool at any time for any reason, there is preferably no server affinity for a user and/or organization to a specific application server 50. In one implementation, therefore, an interface system implementing a load balancing function (e.g., an F5 Big-IP load balancer) is communicably coupled between the application servers 50 and the user systems 12 to distribute requests to the application servers 50. In one implementation, the load balancer uses a least connections algorithm to route user requests to the application servers 50. Other examples of load balancing algorithms, such as round robin and observed response time, also can be used. For example, in certain implementations, three consecutive requests from the same user could hit three different application servers 50, and three requests from different users could hit the same application server 50. In this manner, by way of example, system 16 is multi-tenant, wherein system 16 handles storage of, and access to, different objects, data and applications across disparate users and organizations.

As an example of storage, one tenant might be a company that employs a sales force where each salesperson uses system 16 to manage their sales process. Thus, a user might maintain contact data, leads data, customer follow-up data, performance data, goals and progress data, etc., all applicable to that user's personal sales process (e.g., in tenant data storage 22). In an example of a MTS arrangement, since all of the data and the applications to access, view, modify, report, transmit, calculate, etc., can be maintained and accessed by a user system having nothing more than network access, the user can manage his or her sales efforts and cycles from any of many different user systems. For example, if a salesperson is visiting a customer and the customer has Internet access in their lobby, the salesperson can obtain critical updates as to that customer while waiting for the customer to arrive in the lobby.

While each user's data might be separate from other users' data regardless of the employers of each user, some data might be organization-wide data shared or accessible by a plurality of users or all of the users for a given organization that is a tenant. Thus, there might be some data structures managed by system 16 that are allocated at the tenant level while other data structures might be managed at the user level. Because an MTS might support multiple tenants including possible competitors, the MTS should have security protocols that keep data, applications, and application use separate. Also, because many tenants may opt for access to an MTS rather than maintain their own system, redundancy, up-time, and backup are additional functions that may be implemented in the MTS. In addition to user-specific data and tenant-specific data, system 16 might also maintain system level data usable by multiple tenants or other data. Such system level data might include industry reports, news, postings, and the like that are sharable among tenants.

In certain implementations, user systems 12 (which may be client systems) communicate with application servers 50 to request and update system-level and tenant-level data from system 16 that may involve sending one or more queries to tenant data storage 22 and/or system data storage 24. System 16 (e.g., an application server 50 in system 16) automatically generates one or more SQL statements (e.g., one or more SQL queries) that are designed to access the desired information. System data storage 24 may generate query plans to access the requested data from the database.

Each database can generally be viewed as a collection of objects, such as a set of logical tables, containing data fitted into predefined categories. A “table” is one representation of a data object, and may be used herein to simplify the conceptual description of objects and custom objects according to some implementations. It should be understood that “table” and “object” may be used interchangeably herein. Each table generally contains one or more data categories logically arranged as columns or fields in a viewable schema. Each row or record of a table contains an instance of data for each category defined by the fields. For example, a CRM database may include a table that describes a customer with fields for basic contact information such as name, address, phone number, fax number, etc. Another table might describe a purchase order, including fields for information such as customer, product, sale price, date, etc. In some multi-tenant database systems, standard entity tables might be provided for use by all tenants. For CRM database applications, such standard entities might include tables for case, account, contact, lead, and opportunity data objects, each containing pre-defined fields. It should be understood that the word “entity” may also be used interchangeably herein with “object” and “table”.

In some multi-tenant database systems, tenants may be allowed to create and store custom objects, or they may be allowed to customize standard entities or objects, for example by creating custom fields for standard objects, including custom index fields. Commonly assigned U.S. Pat. No. 7,779,039, titled CUSTOM ENTITIES AND FIELDS IN A MULTI-TENANT DATABASE SYSTEM, by Weissman et al., issued on Aug. 17, 2010, and hereby incorporated by reference in its entirety and for all purposes, teaches systems and methods for creating custom objects as well as customizing standard objects in a multi-tenant database system. In certain implementations, for example, all custom entity data rows are stored in a single multi-tenant physical table, which may contain multiple logical tables per organization. It is transparent to customers that their multiple “tables” are in fact stored in one large table or that their data may be stored in the same table as the data of other customers.

FIG. 6A shows a system diagram illustrating an example of architectural components of an on-demand database service environment 1200 according to some implementations. A client machine located in the cloud 1204, generally referring to one or more networks in combination, as described herein, may communicate with the on-demand database service environment via one or more edge routers 1208 and 1212. A client machine can be any of the examples of user systems 12 described above. The edge routers may communicate with one or more core switches 1220 and 1224 via firewall 1216. The core switches may communicate with a load balancer 1228, which may distribute server load over different pods, such as the pods 1240 and 1244. The pods 1240 and 1244, which may each include one or more servers and/or other computing resources, may perform data processing and other operations used to provide on-demand services. Communication with the pods may be conducted via pod switches 1232 and 1236. Components of the on-demand database service environment may communicate with a database storage 1256 via a database firewall 1248 and a database switch 1252.

As shown in FIGS. 6A and 6B, accessing an on-demand database service environment may involve communications transmitted among a variety of different hardware and/or software components. Further, the on-demand database service environment 1200 is a simplified representation of an actual on-demand database service environment. For example, while only one or two devices of each type are shown in FIGS. 6A and 6B, some implementations of an on-demand database service environment may include anywhere from one to many devices of each type. Also, the on-demand database service environment need not include each device shown in FIGS. 6A and 6B, or may include additional devices not shown in FIGS. 6A and 6B.

Moreover, one or more of the devices in the on-demand database service environment 1200 may be implemented on the same physical device or on different hardware. Some devices may be implemented using hardware or a combination of hardware and software. Thus, terms such as “data processing apparatus,” “machine,” “server” and “device” as used herein are not limited to a single hardware device, but rather include any hardware and software configured to provide the described functionality.

The cloud 1204 is intended to refer to a data network or plurality of data networks, often including the Internet. Client machines located in the cloud 1204 may communicate with the on-demand database service environment to access services provided by the on-demand database service environment. For example, client machines may access the on-demand database service environment to retrieve, store, edit, and/or process information.

In some implementations, the edge routers 1208 and 1212 route packets between the cloud 1204 and other components of the on-demand database service environment 1200. The edge routers 1208 and 1212 may employ the Border Gateway Protocol (BGP). The BGP is the core routing protocol of the Internet. The edge routers 1208 and 1212 may maintain a table of IP networks or ‘prefixes’, which designate network reachability among autonomous systems on the Internet.

In one or more implementations, the firewall 1216 may protect the inner components of the on-demand database service environment 1200 from Internet traffic. The firewall 1216 may block, permit, or deny access to the inner components of the on-demand database service environment 1200 based upon a set of rules and other criteria. The firewall 1216 may act as one or more of a packet filter, an application gateway, a stateful filter, a proxy server, or any other type of firewall.

In some implementations, the core switches 1220 and 1224 are high-capacity switches that transfer packets within the on-demand database service environment 1200. The core switches 1220 and 1224 may be configured as network bridges that quickly route data between different components within the on-demand database service environment. In some implementations, the use of two or more core switches 1220 and 1224 may provide redundancy and/or reduced latency.

In some implementations, the pods 1240 and 1244 may perform the core data processing and service functions provided by the on-demand database service environment. Each pod may include various types of hardware and/or software computing resources. An example of the pod architecture is discussed in greater detail with reference to FIG. 6B.

In some implementations, communication between the pods 1240 and 1244 may be conducted via the pod switches 1232 and 1236. The pod switches 1232 and 1236 may facilitate communication between the pods 1240 and 1244 and client machines located in the cloud 1204, for example via core switches 1220 and 1224. Also, the pod switches 1232 and 1236 may facilitate communication between the pods 1240 and 1244 and the database storage 1256.

In some implementations, the load balancer 1228 may distribute workload between the pods 1240 and 1244. Balancing the on-demand service requests between the pods may assist in improving the use of resources, increasing throughput, reducing response times, and/or reducing overhead. The load balancer 1228 may include multilayer switches to analyze and forward traffic.

In some implementations, access to the database storage 1256 may be guarded by a database firewall 1248. The database firewall 1248 may act as a computer application firewall operating at the database application layer of a protocol stack. The database firewall 1248 may protect the database storage 1256 from application attacks such as structure query language (SQL) injection, database rootkits, and unauthorized information disclosure.

In some implementations, the database firewall 1248 may include a host using one or more forms of reverse proxy services to proxy traffic before passing it to a gateway router. The database firewall 1248 may inspect the contents of database traffic and block certain content or database requests. The database firewall 1248 may work on the SQL application level atop the TCP/IP stack, managing applications' connection to the database or SQL management interfaces as well as intercepting and enforcing packets traveling to or from a database network or application interface.

In some implementations, communication with the database storage 1256 may be conducted via the database switch 1252. The multi-tenant database storage 1256 may include more than one hardware and/or software components for handling database queries. Accordingly, the database switch 1252 may direct database queries transmitted by other components of the on-demand database service environment (e.g., the pods 1240 and 1244) to the correct components within the database storage 1256.

In some implementations, the database storage 1256 is an on-demand database system shared by many different organizations. The on-demand database system may employ a multi-tenant approach, a virtualized approach, or any other type of database approach. An on-demand database system is discussed in greater detail with reference to FIGS. 5A and 5B.

FIG. 6B shows a system diagram further illustrating an example of architectural components of an on-demand database service environment according to some implementations. The pod 1244 may be used to render services to a user of the on-demand database service environment 1200. In some implementations, each pod may include a variety of servers and/or other systems. The pod 1244 includes one or more content batch servers 1264, content search servers 1268, query servers 1282, file servers 1286, access control system (ACS) servers 1280, batch servers 1284, and app servers 1288. Also, the pod 1244 includes database instances 1290, quick file systems (QFS) 1292, and indexers 1294. In one or more implementations, some or all communication between the servers in the pod 1244 may be transmitted via the switch 1236.

In some implementations, the app servers 1288 may include a hardware and/or software framework dedicated to the execution of procedures (e.g., programs, routines, scripts) for supporting the construction of applications provided by the on-demand database service environment 1200 via the pod 1244. In some implementations, the hardware and/or software framework of an app server 1288 is configured to execute operations of the services described herein, including performance of the blocks of methods described with reference to FIGS. 1-4. In alternative implementations, two or more app servers 1288 may be included and cooperate to perform such methods, or one or more other servers described herein can be configured to perform the disclosed methods.

The content batch servers 1264 may handle requests internal to the pod. These requests may be long-running and/or not tied to a particular customer. For example, the content batch servers 1264 may handle requests related to log mining, cleanup work, and maintenance tasks.

The content search servers 1268 may provide query and indexer functions. For example, the functions provided by the content search servers 1268 may allow users to search through content stored in the on-demand database service environment.

The file servers 1286 may manage requests for information stored in the File storage 1298. The File storage 1298 may store information such as documents, images, and basic large objects (BLOBs). By managing requests for information using the file servers 1286, the image footprint on the database may be reduced.

The query servers 1282 may be used to retrieve information from one or more file systems. For example, the query system 1282 may receive requests for information from the app servers 1288 and then transmit information queries to the NFS 1296 located outside the pod.

The pod 1244 may share a database instance 1290 configured as a multi-tenant environment in which different organizations share access to the same database. Additionally, services rendered by the pod 1244 may call upon various hardware and/or software resources. In some implementations, the ACS servers 1280 may control access to data, hardware resources, or software resources.

In some implementations, the batch servers 1284 may process batch jobs, which are used to run tasks at specified times. Thus, the batch servers 1284 may transmit instructions to other servers, such as the app servers 1288, to trigger the batch jobs.

In some implementations, the QFS 1292 may be an open source file system available from Sun Microsystems® of Santa Clara, Calif. The QFS may serve as a rapid-access file system for storing and accessing information available within the pod 1244. The QFS 1292 may support some volume management capabilities, allowing many disks to be grouped together into a file system. File system metadata can be kept on a separate set of disks, which may be useful for streaming applications where long disk seeks cannot be tolerated. Thus, the QFS system may communicate with one or more content search servers 1268 and/or indexers 1294 to identify, retrieve, move, and/or update data stored in the network file systems 1296 and/or other storage systems.

In some implementations, one or more query servers 1282 may communicate with the NFS 1296 to retrieve and/or update information stored outside of the pod 1244. The NFS 1296 may allow servers located in the pod 1244 to access information to access files over a network in a manner similar to how local storage is accessed.

In some implementations, queries from the query servers 1222 may be transmitted to the NFS 1296 via the load balancer 1228, which may distribute resource requests over various resources available in the on-demand database service environment. The NFS 1296 may also communicate with the QFS 1292 to update the information stored on the NFS 1296 and/or to provide information to the QFS 1292 for use by servers located within the pod 1244.

In some implementations, the pod may include one or more database instances 1290. The database instance 1290 may transmit information to the QFS 1292. When information is transmitted to the QFS, it may be available for use by servers within the pod 1244 without using an additional database call.

In some implementations, database information may be transmitted to the indexer 1294. Indexer 1294 may provide an index of information available in the database 1290 and/or QFS 1292. The index information may be provided to file servers 1286 and/or the QFS 1292.

As multiple users might be able to change the data of a record, it can be useful for certain users to be notified when a record is updated. Also, even if a user does not have authority to change a record, the user still might want to know when there is an update to the record. For example, a vendor may negotiate a new price with a salesperson of company X, where the salesperson is a user associated with tenant Y. As part of creating a new invoice or for accounting purposes, the salesperson can change the price saved in the database. It may be important for co-workers to know that the price has changed. The salesperson could send an email to certain people, but this is onerous and the salesperson might not email all of the people who need to know or want to know. Accordingly, some implementations of the disclosed techniques can inform others (e.g., co-workers) who want to know about an update to a record automatically.

The tracking and reporting of updates to a record stored in a database system can be facilitated with a multi-tenant database system 16, e.g., by one or more processors configured to receive or retrieve information, process the information, store results, and transmit the results. In other implementations, the tracking and reporting of updates to a record may be implemented at least partially with a single tenant database system.

The specific details of the specific aspects of implementations disclosed herein may be combined in any suitable manner without departing from the spirit and scope of the disclosed implementations. However, other implementations may be directed to specific implementations relating to each individual aspect, or specific combinations of these individual aspects.

While the disclosed examples are often described herein with reference to an implementation in which an on-demand database service environment is implemented in a system having an application server providing a front end for an on-demand database service capable of supporting multiple tenants, the present implementations are not limited to multi-tenant databases nor deployment on application servers. Implementations may be practiced using other database architectures, i.e., ORACLE®, DB2® by IBM and the like without departing from the scope of the implementations claimed.

It should be understood that some of the disclosed implementations can be embodied in the form of control logic using hardware and/or using computer software in a modular or integrated manner. Other ways and/or methods are possible using hardware and a combination of hardware and software.

Any of the software components or functions described in this application may be implemented as software code to be executed by a processor using any suitable computer language such as, for example, Java, C++ or Perl using, for example, conventional or object-oriented techniques. The software code may be stored as a series of instructions or commands on a computer-readable medium for storage and/or transmission, suitable media include random access memory (RAM), a read only memory (ROM), a magnetic medium such as a hard-drive or a floppy disk, or an optical medium such as a compact disk (CD) or DVD (digital versatile disk), flash memory, and the like. The computer-readable medium may be any combination of such storage or transmission devices. Computer-readable media encoded with the software/program code may be packaged with a compatible device or provided separately from other devices (e.g., via Internet download). Any such computer-readable medium may reside on or within a single computing device or an entire computer system, and may be among other computer-readable media within a system or network. A computer system, or other computing device, may include a monitor, printer, or other suitable display for providing any of the results mentioned herein to a user.

While various implementations have been described herein, it should be understood that they have been presented by way of example only, and not limitation.

Thus, the breadth and scope of the present application should not be limited by any of the implementations described herein, but should be defined only in accordance with the following and later-submitted claims and their equivalents. 

What is claimed is:
 1. A database system comprising: at least one server having at least one processor capable of executing instructions configured to cause: selecting a subset of entries of a log file based on data indicating approved entry types, the entries having data fields, the entries corresponding to user events associated with a plurality of users of tenants in a multi-tenant database system; selecting a subset of the data fields of the subset of the entries based on data indicating approved data fields of a corresponding entry type; and generating a plurality of customer-facing log files based on a client identifier data field in each entry in the subset of the entries, each of the customer-facing log files being associated with a corresponding tenant of the multi-tenant database system, and each of the customer-facing log files comprising the subset of the data fields of the subset of the entries associated with the corresponding tenant.
 2. The database system of claim 1, the instructions further configured to cause: capturing a series of user events as the entries in the log file.
 3. The database system of claim 1, wherein the data indicating approved entry types and the data indicating approved data fields are provided by a data file comprising attributions of entry types and data fields indicating the approved entry types and approved data fields.
 4. The database system of claim 3, wherein the data file is an extensible markup language (XML) file.
 5. The database system of claim 1, the instructions further configured to cause: storing pointers associated with the plurality of customer-facing log files as at least one data object in a database, wherein each of the pointers indicates a location of a corresponding customer-facing log file.
 6. The database system of claim 5, wherein the tenants are capable of accessing the customer-facing log files using an application programming interface (API) providing access to the pointers.
 7. A computer implemented method for processing a log file, the method comprising: receiving, by a server associated with a multi-tenant database system, a log file comprising entries having data fields, the entries corresponding to user events associated with a plurality of users of tenants in the multi-tenant database system; selecting, by the server, a subset of the entries of the log file based on data indicating approved entry types; selecting, by the server, a subset of the data fields of the subset of the entries based on data indicating approved data fields of a corresponding entry type; and generating, by the server, a plurality of customer-facing log files based on a client identifier data field in each entry in the subset of the entries, each of the customer-facing log files being associated with a corresponding tenant of the multi-tenant database system, and each of the customer-facing log files comprising the subset of the data fields of the subset of the entries associated with the corresponding tenant.
 8. The computer implemented method of claim 7, further comprising: capturing a series of user events as the entries in the log file.
 9. The computer implemented method of claim 7, wherein the data indicating approved entry types and the data indicating approved data fields are provided by a data file comprising attributions of entry types and data fields indicating the approved entry types and approved data fields.
 10. The computer implemented method of claim 9, wherein the data file is an extensible markup language (XML) file.
 11. The computer implemented method of claim 7, further comprising: storing pointers associated with the plurality of customer-facing log files as at least one data object in a database, wherein each of the pointers indicates a location of a corresponding customer-facing log file.
 12. The computer implemented method of claim 11, wherein the tenants are capable of accessing the customer-facing log files using an application programming interface (API) providing access to the pointers.
 13. The computer implemented method of claim 1, wherein selecting the subset of the entries of the log file based on data indicating approved entry types further includes selecting sub-entries of the entries based on data indicating approved sub-entry types.
 14. A system comprising: database system software stored on a non-transitory data storage medium for execution by at least one server of a database system, the database system software operable to cause: selecting a subset of entries of a log file based on data indicating approved entry types, the entries having data fields, the entries corresponding to user events associated with a plurality of users of tenants in a multi-tenant database system; selecting a subset of the data fields of the subset of the entries based on data indicating approved data fields of a corresponding entry type; and generating a plurality of customer-facing log files based on a client identifier data field in each entry in the subset of the entries, each of the customer-facing log files being associated with a corresponding tenant of the multi-tenant database system, and each of the customer-facing log files comprising the subset of the data fields of the subset of the entries associated with the corresponding tenant.
 15. The system of claim 14, the database system software further operable to cause: capturing a series of user events as the entries in the log file.
 16. The system of claim 14, wherein the data indicating approved entry types and the data indicating approved data fields are provided by a data file comprising attributions of entry types and data fields indicating the approved entry types and approved data fields.
 17. The system of claim 16, wherein the data file is an extensible markup language (XML) file.
 18. The system of claim 14, the database system software further operable to cause: storing pointers associated with the plurality of customer-facing log files as at least one data object in a database, wherein each of the pointers indicates a location of a corresponding customer-facing log file.
 19. A computer program product comprising program code to be executed by at least one processor when retrieved from a non-transitory computer-readable medium, the program code comprising instructions configured to cause: selecting a subset of entries of a log file based on data indicating approved entry types, the entries having data fields, the entries corresponding to user events associated with a plurality of users of tenants in a multi-tenant database system; selecting a subset of the data fields of the subset of the entries based on data indicating approved data fields of a corresponding entry type; and generating a plurality of customer-facing log files based on a client identifier data field in each entry in the subset of the entries, each of the customer-facing log files being associated with a corresponding tenant of the multi-tenant database system, and each of the customer-facing log files comprising the subset of the data fields of the subset of the entries associated with the corresponding tenant.
 20. The computer program product of claim 19, wherein the data indicating approved entry types and the data indicating approved data fields are provided by a data file comprising attributions of entry types and data fields indicating the approved entry types and approved data fields.
 21. The computer program product of claim 19, the instructions further configured to cause: storing pointers associated with the plurality of customer-facing log files as at least one data object in a database, wherein each of the pointers indicates a location of a corresponding customer-facing log file. 